Reducing Multicore Bandwidth Requirements for Combinatorial Multigrid
نویسندگان
چکیده
Memory bandwidth is a major limiting factor in the scalability of parallel algorithms. In this paper, we introduce hierarchical diagonal blocking, a sparse matrix representation which we believe captures most of optimization techniques in a common representation. It can take advantage of symmetry while still being easy to parallelize. It takes advantage of, or actually requires, reordering. It also allows for simple compression of column indices. As applications, we show how to use this highperformance SpMV kernel, together with precision reduction techniques, in a combinatorial multigrid solver to lower the bandwidth consumption without sacrificing the final solution’s quality. We provide extensive empirical evaluation of the effectiveness of the approach on a variety of benchmark matrices, demonstrating substantial speedups on all matrices considered.
منابع مشابه
A Fast Multigrid Solver for Molecular Dynamics on the Cell Broadband Engine
In the field of molecular dynamics, Poisson’s equation with open boundary conditions plays a major role. One issue that comes up, when this equation is solved numerically, is the infinite size of the domain. This prevents a direct solution, so that other concepts have to be considered. Within the scope of this thesis, a method is discussed that uses hierarchical coarsened grids to overcome this...
متن کاملOn the Performance of an Algebraic Multigrid Solver on Multicore Clusters
Algebraic multigrid (AMG) solvers have proven to be extremely efficient on distributed-memory architectures. However, when executed on modern multicore cluster architectures, we face new challenges that can significantly harm AMG’s performance. We discuss our experiences on such an architecture and present a set of techniques that help users to overcome the associated problems, including thread...
متن کاملHighly Parallel Multigrid Solvers for Multicore and Manycore Processors
In this paper we present an analysis of parallelization properties and implementation details of the new Algebraic multigrid solvers. Variants of smoothers and multicolor grid partitionings are discussed. Optimizations for modern throughput-oriented processors are considered together with different storage schemes. Finally, comparative performance results for multicore and manycore processors a...
متن کاملFinite Element Computations on Multicore and Graphics Processors
Ljungkvist, K. 2017. Finite Element Computations on Multicore and Graphics Processors. Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Science and Technology 1512. 64 pp. Uppsala: Acta Universitatis Upsaliensis. ISBN 978-91-554-9907-5. In this thesis, techniques for efficient utilization of modern computer hardwarefor numerical simulation are considered. In particul...
متن کاملThe Communication Performance of the Cray T3D and its Effect on Iterative Solvers
On many distributed memory systems, such as workstation clusters or the Intel iPSC/860, the multigrid algorithm suffers from having extensive communication requirements and, in general, it is not very competitive in comparison to the conjugate gradient algorithm. This is in contrast to the sequential problem whereby the multigrid algorithm is very effective in reducing the global residual, part...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010